Spotting Japanese CV-Syllables and Phonemes Using Time-Delay Neural Networks
نویسندگان
چکیده
Syllable or phoneme spotting if reliably achieved, provides a good solution to the spoken word andlor continuous speech recognition problem, . We previously showed tha t the Time-Delay Neural Network (TDNN) provided excellent recognition performance (98.6%) of the "BDG" consonant task. We would also like to extend the encouraging performance of TDNN to wordlcontinuous speech recognition. In this paper, we show techniques for spotting Japanese CV syllableslphonemes in input speech based on TDNNs. We constructed the TDNN which can discriminate a single CV-syllable or phoneme group. In Japanese, there are only about one hundred syllables, or less than thirty phonemes, which makes i t feasible to prepare and train the TDNN to spot all possible syllables or phonemes extracted as training tokens from training words. Syllable and phoneme spotting experiments show excellent results, including a syllable spotting rate of better than 96.7% correct. These spotting techniques are proved to be a good step toward continuous speech recognition.
منابع مشابه
Integrating connectionist, statistical and symbolic approaches for continuous spoken Korean processing
This paper presents a multi-strategic and hybrid approach for large-scale integrated speech and natural language processing, employing connectionist, statistical and symbolic techniques. The developed spoken Korean processing engine (SKOPE) integrates connectionist TDNN-based phoneme recognition technique with statistical Viterbi-based lexical decoding and symbolic morphological/phonological an...
متن کاملDecentralized Adaptive Control of Large-Scale Non-affine Nonlinear Time-Delay Systems Using Neural Networks
In this paper, a decentralized adaptive neural controller is proposed for a class of large-scale nonlinear systems with unknown nonlinear, non-affine subsystems and unknown nonlinear time-delay interconnections. The stability of the closed loop system is guaranteed through Lyapunov-Krasovskii stability analysis. Simulation results are provided to show the effectiveness of the proposed approache...
متن کاملEffects of Mora Phonemes on Japanese Word Accent
Effects of mora phonemes on Japanese word accent was analyzed statistically, utilizing a set of about 124,000 frequently used common nouns derived from the Japanese Word Dictionary edited by EDR (the Japan Electronic Dictionary Research Institute, Ltd., Japan). In this analysis, Japanese syllable was defined as preceding consonant (+semi-vowel) +following vowel, accompanied / not accompanied by...
متن کاملA Hybrid Stochastic Connectionist Approach to Automatic Speech Recognition
This report focuses on a hybrid approach, including stochastic and connectionist methods , for continuous speech recognition. Hidden Markov Models (HMMs) are a popular stochastic approach used for continuous speech, well suited to cope with the high variability found in natural utterances. On the other hand, artiicial neural networks (NNs) have shown high classiication power for short speech ut...
متن کاملRecurrent Neural-Network Learning of Phonological Regularities in Turkish
Simple recurrent networks were trained with sequences of phonemes from a corpus of Turkish words. The network's task was to predict the next phoneme. The aim of the study was to look at the representations developed within the hidden layer of the network in order to investigate the extent to which such networks can learn phonological regularities from such input. It was found that in the differ...
متن کامل